granite-vision-3.2-2b is a compact and efficient vision-language model specifically designed for visual document understanding, capable of automatically extracting content from tables, charts, infographics, and more.
Image-to-Text
Transformers English